Lin et al. SpringerPlus 2014, 3:486 
http://www.springerplus.eom/content/3/1/486 



0 SpringerPlus 

a SpringerOpen Journal 



RESEARCH Open Access 



Identification, cross-taxon transferability and 
application of full-length cDNA SSR markers in 
Phyllostachys pubescens 

Yuan Lin 1 , Jiang-Jie Lu 1 , Miao-Dan Wu 1 , Ming-Bing Zhou 1 , Wei Fang 1 , Yuji Ide 2 and Ding-Qin Tang 1,2 * 



Abstract 

Current databases of Phyllostachys pubescens full-length cDNAs (FL-cDNAs) provide a rich source of sequences for 
the development of potential FL-cDNA simple sequence repeat (SSR) markers. We screened 10,608 P. pubescens 
cDNAs, discovering 1614 SSRs in 1382 SSR-containing FL-cDNAs. The SSRs were more abundant within transposable 
elements (TEs) than expressed sequence tags (ESTs) and genome survey sequences (GSSs), and specific dinucleotide 
repeats tended to associate with particular TE families: (TA) n with En/Spm and (CT) n with Mutator. A selected panel 
of 100 FL-cDNAs containing type I SSRs yielded 68 functional SSR markers with an average polymorphism information 
content (PIC) value of 0.12, among which 22 loci contained polymorphisms. These markers became less transferable 
(83.1 % -> 69.9% ->■ 49.3%) but more polymorphic (79.4% -> 92.3% -» 92.8%) with increasing phylogenetic distance 
(intra-genus — > intra-subtribe — ► intra-family). Transferability and polymorphism also depended on the location of the 
marker, with those located in the coding region being more transferable (69.1%) and less polymorphic (89.4%) than 
those in the 5'-UTR (63.4% transferable, 90.7% polymorphic) and the 3'-UTR (61 .8% transferable, 91 .4% polymorphic). 
As proof of principle, we were able to use our FL-cDNA SSR markers to identify the parental stocks in interspecific 
hybrids of bamboo within and beyond P. pubescens, and estimate the outcrossing rate for P. pubescens. Our research 
should facilitate molecular breeding in bamboo species where original genetic markers are scarce. 

Keywords: Phyllostachys pubescens {edulis); Microsatellite (SSR); Cross-taxon transferability /polymorphism; Hybrid 
identification; Outcrossing-rate estimation 



Background 

Bambusoideae is a subfamily of the grass family Poaceae 
and is further divided into nine subtribes comprising 
more than 80 bamboo genera and about 1400 species 
worldwide. Fifty genera and more than 500 species are 
found in China, among which Phyllostachys pubescens 
(synonym: P. edulis) is commercially the most important 
species providing the third largest source of timber and 
the most predominant source of bamboo shoots. P. pub- 
escens plantations cover an area of 3 million ha (approxi- 
mately 2% of the total forest area), which has doubled 
over the last 30 years and taken on a more important 
ecological role (Fu 2001). Compounds extracted from 
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P. pubescens have recently shown potential for the treat- 
ment of obesity and other diseases (Higa et al. 2012). 
However, various problems associated with P. pubescens 
plantations including its simultaneous flowering intervals 
of more than 60 years and recovers from a limited number 
of clones (Janzen 1976; Watanabe et al. 1982). Addition- 
ally, the little knowledge of its basic biology, genetics and 
breeding system bring about the practical difficulties asso- 
ciated with the identification and characterization of su- 
perior genotypes. 

Molecular markers developing from microsatellites, 
also known as simple sequence repeats (SSRs) with 
characterization of high genome coverage, random 
dispersion, co-dominant inheritance, reproducibility and 
amenability to automation in high throughout genotyping, 
have gained considerable spotlight recently. By now, micro- 
satellite markers have been developed for several other 
bamboo species, e.g. six loci for Bambusa arundinacea 
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(Nayak and Rout 2005), eight loci for Sasa senanensis 
(Miyazaki et al. 2009) and eight loci for S. cernua (Kitamura 
and Kawahara 2009). We identified 19 GenBank microsat- 
ellite markers in P. pubescens and related species (Tang 
et al. 2010), and 15 expressed sequence tag (EST) SSR 
markers for Bambusa species (Dong et al. 2011). Recently, 
the Bamboo Full-Length cDNA Project (Peng et al. 2010) 
has generated a vast amount of publicly-available 
P. pubescens cDNA sequence data that can be used for 
gene discovery, comparative genomics/transcriptomics 
and marker development. Microsatellites derived from 
cDNAs or ESTs are highly transferable to closely related 
species (Zhang et al. 2005) facilitating the development 
of gene-based maps that may increase the efficiency of 
marker-assisted selection through the use of candidate 
genes (Rossi et al. 2003; Lu et al. 2006). 

Here, we report the use of P. pubescens full-length 
cDNA (FL-cDNA) sequences to 1) analyze the association 
between SSRs and transposable elements (TEs) in the 
transcriptome; 2) develop and validate FL-cDNA SSR 
markers and determine their transferability to other bam- 
boo species; and 3) apply the polymorphic SSR markers 
to estimate outcrossing rates in P. pubescens and identify 
bamboo interspecies hybrids. 

Results and discussion 

Association between SSRs and TEs in the P. pubescens 
transcriptome 

We analyzed 10,608 P. pubescens FL-cDNA sequences 
available in NCBI GenBank, representing -7171 kb of 
DNA. EST-trimmer was used to remove poly(A/T) runs, 
and the remaining sequence data were screened using 
MISA, identifying 2330 SSRs in 2014 cDNAs, the 
remaining cDNAs lacking SSRs. The sequences were 
clustered with CAP3, reducing the collection to 1614 
non-redundant SSRs in 1382 cDNA contigs (Additional 
file 1: Figure SI). Peng et al. (2010) described the distri- 
bution of SSRs in the P. pubescens transcriptome in detail. 
Therefore, we selectively analyzed the non-redundant 
cDNA sequences and contigs with RepeatMasker to deter- 
mine the association between SSRs and TEs because pre- 
vious reports have shown that many SSRs are located in 
TEs (Richard et al. 2008), e.g. 50% of SSRs in the human 
genome (Scherer 2008), and that SSRs are closely associ- 
ated with TEs in rice (Akagi et al. 2001; Temnykh et al. 
2001) and barley (Wei et al. 2002). The results revealed 95 
TEs, representing 13.52 kb (0.27%) of the total cDNA se- 
quence data. Further analysis showed that 29 TEs con- 
tained a total of 39 SSRs, accounting for 822 bp (6.41%) of 
the total TE DNA sequences. In comparison, the non- 
redundant EST sequence data (7089 cDNAs refined from 
the original 10,608 sequences) contained 1614 SSRs, ac- 
counting for 2.60% of the total cDNA sequences in length. 
Therefore, SSRs were approximately 2.5-, 65.4-fold more 



abundant in TEs compared to cDNAs (Table 1) and whole 
genome (0.098% based on the analysis of genome survey 
sequences (GSSs; Tang et al. 2010)). It is possible that 
SSRs within TEs are also involved in the regulation of 
gene expression (Tomilin 2008). 

Some studies have also suggested associations between 
specific SSR motifs and particular TE families, e.g. (TA) n 
is often found in the 5'-UTR of Micron element trans- 
posase genes in rice (Akagi et al. 2001; Temnykh et al. 
2001). We also investigated the distribution of SSRs 
among DNA transposons, and found they were most 
likely to occur in En/Spm elements (33.87% of the total 
En/Spm DNA sequence). Six SSRs were found in five 
En/Spm elements, with one element containing two 
SSRs (Table 2). Mutator elements were the next most 
likely to contain SSRs (14.19% of the total Mutator DNA 
sequence). Thirteen SSRs were found in 12 Mutator ele- 
ments, again with one element containing two SSRs 
(Table 2). The situation was very different among retro- 
transposons, with only 0.30% of the total Tyl-copia 
DNA sequence and 0% of Ty3-gypsy DNA sequence 
made up of SSRs. More detailed investigation of specific 
repeat motifs showed that four of the six SSRs found in 
En/Spm elements were TA/AT repeats, and 10 of the 13 
SSRs found in Mutator elements were CT/AG repeats. 
All 13 of the Mutator SSRs and six of the En/Spm SSRs 
were located in the 5 -UTR. It has been reported that 
TE molecular markers (mPing) showed significantly 
higher levels of polymorphism than all other molecular 
markers in closely-related rice cultivars (Monden et al. 
2009). Considering that it is difficult to detect genetic 
variation in P. pubescens using ordinary markers (Lin 
et al. 2009; Tang et al. 2010), SSRs in TEs therefore ap- 
pear to be promising markers for bamboo species. 

Development and polymorphism assessment of FL-cDNA 
SSR markers for P. pubescens 

Original collection of 10,680 P. pubescens FL-cDNA 
sequences produced 1382 cDNA contigs containing SSRs. 
Sequences containing mononucleotide repeat motifs 
were excluded, leaving 1051 cDNA sequences contain- 
ing SSRs with 2-6 nt repeats motifs (Additional file 1: 
Figure SI). Following the procedure already adopted 
for rice (Temnykh et al. 2001). We were able to design 
primer pairs for 583 (55%) of these cDNAs, the remainder 
offering either insufficient flanking DNA (over half of the 
SSRs were found in the 5 ' or 3 ' UTRs) or flanking DNA 
that was unsuitable for primer design. Only 325 (24.1%) 
of the SSRs were type I repeats (>20 bp), which offer 
greater potential for marker development. The 100 most 
promising sequences were selected for PCR validation, 
including dinucleotide repeats with >12 repeat units, 
trinucleotide repeats with >8 repeat units, tetranucleo- 
tide repeats with >6 repeat units, pentanucleotide and 
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Table 1 Association between FL-cDNA SSRs and transposable elements (TEs) in P. pubescens 



TE family 


No. 


L&ncjth 
(bp) 


No. of 
TE-SSR 


No. of 
SSR-TE 


TE-SSR/SSR-TE (%) 
(in length ) 


1 nt 


No. of SSRs with repeat units of: 
2 nt 3 nt 4 nt 5 nt 


6 nt 


Total TEs 


95 


13522 


36 


29 


6.08 


1 


31 


4 


0 


1 


0 


En/Spm 


8 


815 


6 


b 


33.87 


0 


5 


1 


0 


1 


0 


Mutator 


28 


2931 


13 


12 


14.19 


1 


10 


2 


0 


0 


0 


Ty1 -copia 


20 


4010 


1 


1 


0.3 


0 


0 


1 


0 


0 


0 


Ty3-gyspy 


1/ 


2913 


0 


0 


0 


0 


0 


0 


0 


0 


0 


Other TEs 


22 


2853 


16 


11 


8.70 


0 


16 


0 


0 


0 


0 


EST 


7089 


4942281 


1614 


N.A. 


2.60 


271 


489 


789 


30 


14 


21 



hexanucleotide repeats with >5 repeat units and some 
compound SSRs with >24 repeats (Table 3). We found 
that 32 of the selected cDNAs were unsuitable because 
the PCR failed to generate a product (four cDNAs) or gen- 
erated products lacking SSRs (28 cDNAs), but the 
remaining 68 sequences allowed the development of FL- 
cDNA SSR markers (Table 3). These contained 18 com- 
pound SSRs, 19 dinucleotide repeats, 18 trinucleotide 
repeats, four tetranucleotide repeats, three pentanucleo- 
tide repeats and six hexanucleotide repeats. Interestingly, 
although 45 of the cDNAs (66.2%) generated the an- 
ticipated PCR product, 16 (23.5%) generated products 
with more repeats than expected, five (7.4%) generated 



Table 2 Distribution of SSRs in En/Spm and Mutator 
transposons 



ID 


SSR motifs 


Length (bp) Starting 


Ending 


Location 


SSR distribution in En/Spm transposons 


FP091991 


(GAGGA) 6 


30 


109 


138 


CDS 


FP091422 


(TA) 22 (CA) 9 


62 


12 


73 


5'UTR 


FP097776 


(TA) 23 


46 


1 


46 


5'UTR 


FP 100462 


(TA) 31 


62 


14 


75 


5'UTR 


FP100841 


(CGG) 6 


18 


38 


55 


5'UTR 


FP 100858 


(AT) 29 


58 


22 


79 


5'UTR 


SSR distribution in Mutator transposons 


FP 100733 


(TQ 8 -(GGC) 5 


89 


20 


108 


5'UTR 


FP 100664 


(AG) 17 


34 


32 


65 


5'UTR 


FP094905 


(CT) 17 


34 


1 


34 


5'UTR 


FP099988 


(CT) 19 


38 


1 


38 


5'UTR 


FP094782 


(CT), 2 


24 


/ 


30 


5'UTR 


FP099842 


(CT) 15 


30 


4 


33 


5'UTR 


FP091749 


(CT) 23 


46 


2 


4/ 


5'UTR 


FP093400 


(GA),e 


32 


23 


54 


5'UTR 


FP096707 


(GAA) 8 


24 


36 


59 


5'UTR 


FP096801 


UQ 8 


16 


2 


17 


5'UTR 


FP099127 


(AG) 18 


36 


40 


75 


5'UTR 


FP099725 


(Ql3 


13 


1 


13 


5'UTR 



products with fewer repeats than expected, and two 
(PBM050 and PBM055) generated products with dif- 
ferent repeats and flanking sequences than those an- 
ticipated. The unanticipated amplification resulted in 
three SSR markers (PBM036, PBM055 and PBM 077) 
containing type II repeats (12-19 bp in length) and 
one marker (PBM079) shorter than 12 bp. In total, 67 
sequences were deposited in GenBank (accession nos 
GU644371-GU644438). Based on BLASTX analysis, pu- 
tative functions were assigned to most (66.2%) of the 
cDNA sequences with significant similarity to known pro- 
teins, whereas 27.9% matched unknown/hypothetical 
proteins and 5.9% were novel sequences (Table 3). 

One hundred and seven primer pairs finally yielded 68 
FL-cDNA SSR markers for P. pubescens, which is to- 
wards the lower end of the 60-90% success rate previ- 
ously reported in sugarcane (Cordeiro et al. 2001), barley 
(Thiel et al. 2003), wheat (Yu et al. 2004) and peanut 
(Liang et al. 2009). Squirrell et al. (2003) defined the suc- 
cessive loss of sequenced fragments and designed primers, 
until arriving at a final collection of "working SSRs" pro- 
ducing discrete bands of the expected size, as the "attrition 
rate". Kofler et al. (2008) reported a high attrition rate 
when developing SSR markers from enriched libraries, 
BAC-end sequences and ESTs in rye, possibly reflecting 
the large number of TEs in the rye genome. Tero et al. 
(2006) found that the number of SSR markers was re- 
duced when the markers were predominandy located 
within TEs. Squirrell et al. (2003) suggested that SSR 
marker development would be challenging in polypoid 
species and species such as wheat and rye with large num- 
bers of TEs. P. pubescens has 2n = 48 chromosomes and is 
thought to be tetraploid (Li et al. 1999). The genome is > 
2000 Mb, which is approximately 5.4 times larger than 
diploid cultivated rice (Gui et al. 2007), and it contains a 
large number of TEs (Zhong et al. 2010; Zhou et al. 
2010a, b, c). The slightiy higher attrition rate we encoun- 
tered therefore seems reasonable when considering the 
chromosomal polyploidy, size and TE content of the 
genome. We also encountered a higher attrition rate 
in B. oldhamii (Li et al. 2001), a hexaploid bamboo 
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Table 3 Characteristics of the P. pubescens SSR markers derived from FL-cDNAs 



No. Marker Accession no. 
Name cDNA SSR 



Primer sequence (5'— > 3') 



Tm 



PCR 



Motif „ n fragment 

{ ' (bp) 



PIC Putative function 



1 PBM031 FP094740 GU644371 



2 PBM032 FP098085 GU644372 



3 PBM033 FP099510 GU644373 



4 PBM034 FP098748 GU644374 



5 PBM035 FP100911 GU644375 



6 PBM036 FP096684 GU644376 



7 PBM037 FP101192 GU644377 



8 PBM038 FP101125 GU644378 



9 PBM039 FP091409 GU644379 



10 PBM040 FP096343 GU644380 



11 PBM041 FP095242 GU644381 



12 PBM042 FP093940 GU644382 



13 PBM043 FP099127 GU644383 



14 PBM044 FP095585 GU644384 



15 PBM045 FP098751 GU644385 



16 PBM046 FP094276 GU644386 



17 PBM047 FP099829 GU644387 



18 PBM048 FP092637 GU644388 



19 PBM049 FP099913 GU644389 



20 PBM050 FP093015 GU644390 



21 PBM051 



FP092618 
(FP099842) 



GU644391 



CGCCGAGTTCCCTATTATTATTT 

AGCACAGCCTCCGTGATTG 

TTTCCCAAATAAAACCTCACC 

GTCCATTTAGGGTTCCACTGA 

CTGACTGTGCGTGCGTCTC 

CTTGGTCTCGCTCATCTCCTC 

TCGGCTCGGCGTGATGGAT 

ATCGGCATCCGCGACTGCC 

ACCGTGATGACTACCGCCGCGACC 

TGCTGCCTCCACCCCTCCGTCC 

CACATGGACCGCCTCATCC 

GCAACAAAACGAGAACCAGAC 

TGCAAGCCTGCTATACGTTT 

GAAGTGGGAGTACATACTFCCCA 

GGTCGGCTCATTTTGTAGTGT 

CAACCTTCAGGCAATAGATTACAT 

CATCCTCAGTTTCTCACCG 

CAGCTTCACCAACTTGTGG 

GAATCATCTGGGAAGAAGAAGGA 

TGCATTGCATTTGGCTTAGTAGT 

TGGTGTTGCCTGTGACCTTAC 

CCCACCTCCACCTCTACTACG 

TCCTTTACGGCTTTACCCC 

GCCCCAGCTTAGTACACCAC 

CTCACCGCCCCACCTCGCA 

CGGCTGCTGATGCGGAGGA 

AAGGCCCACGTTGCCAGAC 

GTTCCCGTTGATGCCCCAC 

TGAGCGAGGTAGTTTCATTTTAGTTA 

CCTACGACGAGTAGATTGCGAGT 

CTCAGAGCAGACACTGCTTATTCC 

GCGTCTTCATTGCAGCCATCT 

ACCACGTTGCAGGATTCACT 

CGATGAGCAGCACAACAGC 

GCAAAAGAGCGCACTTGAC 

GGAGGACACTAGAGTTGGCATT 

ACAGCAGATAGTCCCAAAAT 

GACAGCAGGATGAAGAGCA 

AGTATAGTATGTTCGTTTAAGTGG 

TGTAATGTTTAAGGTTCCGT 

AGACATTGTCAACTGTAAGTTGGTAGAG 

TTTACAAGCAATACACCCAGAAATAG 



(AG) 6 -{AG) 7 56 

(CCG) 7 -(CCT} 6 56 

(CG) 8 (AG) 14 56 

(GAG) 5 (GCG) 5 62 

(GTG) 7 -(GTG) 7 62 
(TA) 8 

<TA) 7 -(TA)e 47 

(TQ 9 (TA) 22 48 

(TC) 12 -(CTT) 6 53 

(FC) 7 -(TQ 7 51 

(TG) 8 (AG), 0 53 

(GA) 7 -(AG) 6 53 
(AG), 3 
(AG) 20 
(CA) 20 

(CTV(C7) 6 
(CT) 13 
(CT) 27 
(GA)„ 
(CA)n 
(TC) 23 



0 MYB-like transcription factor 



143 0 PHD finger protein 



155 0 Small GTP-binding protein 



169 0 GTP binding protein 



165 0.368 U2 snRNP auxiliary factor 



47 1 69 0.259 Polypeptide-associated complex 

alpha subunit-like protein 



130 0 Thaumatin family protein 



210 0.365 GCIP-interacting 

family protein-like 

171 0.355 Unknown protein 



1 78 0 Bicolor hypothetical protein 



167 0 typeA response regulator 1 



156 0.365 SAM and SH3 domain- 

containing protein 1 



60 128 0 IAA1 5 - auxin-responsive 

Aux/IAA family member 

55 173 0.371 Bicolor hypothetical protein 



53 132 0.322 DRE binding factor 



102 0.395 Unknown protein 



53 119 0 Bicolor hypothetical protein 



53 163 0 Serine carboxypeptidase 

1 precursor 

50 117 0.305 Unknown protein 



45 137 0 Oxidoreductase 



50 111 0 VQ motif family protein 
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Table 3 Characteristics of the P. pubescens SSR markers derived from FL-cDNAs (Continued) 



140 



02 



(GGCGGA) 6 62 121 

(TC) 7 53 199 

<CTCCAT) 6 53 1 77 

(ACACAG) 5 59 1 1 1 



22 PBM052 FP095787 GU644392 AGCGGGCAGGCTATGTATT fTCT)„ 

TTGCTTCTCCCCTAATGACA 

23 PBM053 FP094717 GU644393 CCCCATAATCTGCTCCCTTCT (TTC) 10 

GGTTCTTGGCGTATGGTATGTTC 

24 PBM054 FP100158 GU644394 ATCGGGAGGGATGCGGCAGC 

GCGGACCAAGCGGAACACC 

25 PBM055 FP1 00601 GU644395 CATGGATGTTGTTGAGTTGAGGC 

GCACAAAGACTAGTACTCGAGGTGG 

26 PBM056 FP1 00601 GU644396 CATGGATGTTGTTGAGTTGAGGC 

GCACAAAGACTAGTACTCGAGGTGG 

27 PBM057 FP097951 GU644397 CGCCCACCCCTCCTTCGTCT 

TCCTTGGCACGGCCACTCA 

28 PBM058 FP097794 GU644398 GGCCGAGATCCTCCTTTCT (GGCGGT) 5 59 171 

CCATCCCCGCCTTCACCAC 

29 PBM059 FP094127 GU644399 ATTAGTCACGCACCGAGAAGGAA (AGATG) 6 55 1 72 

AGACGCAAGAACTCGACAGGGA 

30 PBM060 FP101691 GU644400 CACGCCAGCTCCAGATGCCACCAT (CACCC) 5 59 1 19 

TGCCCTTCCACCTCCTCTGACCTCC 

31 PBM061 FP095238 GU644401 CCCTATCCCATCCTCCTCCC (CCTCT) 5 55 1 19 

GGTTGCTCACTTTCCTGCTCC 

32 PBM062 FP096136 GU644402 TGCTGGTTGGGTTCATCACGA (TTCT) 7 53 156 

GAGGGTTACAACAGGGGCAAAGA 

33 PBM063 FP098746 GU644403 CAACGCAACGCCATTCCAAACA 

CACCTCCAGGCCCTGGTACTCCA 

34 PBM064 FP099572 GU644404 CATTTCTCATTGCCGCTGTAAC 

TCCTTTGCCCTCCTCTTCCT 

35 PBM065 FP096965 GU644405 GTCAGTCAGGCGGCACGAG 

CGCGTAGGACGAGATCACCTC 

36 PBM066 FP095562 GU644406 CTCTTCACCGAAACCGAAAG iOVj< i~ : " 

CGTTGAGGTTCCTGAGGTAGAC 

37 PBM067 FP098504 GU644407 GGTGCGGGTGCAGTTTATT (CTT) 8 51 185 

AGCATCATCCGCCAGAATA 

38 PBM068 FP093884 GU644408 AACCGTGCACTACTTGCTCT (TCT) 8 51 155 

ACCTTGTGGACGACATGGA 

39 PBM069 FP099427 GU644409 CCCTTTCCCTTCAACAACAA (CCG) 8 57 101 

TACCGATCCATGGCTCCTT 

40 PBM070 FP094239 GU644410 TCGTGCCTTTCGCCTCCTG (TCT) 7 55 1 17 

CTGTACGGCCCGAACTTGTA 

41 PBM071 FP093953 GU64441 1 AGCGTCACCTCCGCCTTCT (CGA) 8 57 101 

TCCTTGGCCTCGTCTTGGT 

42 PBM072 FP093285 GU644412 CCTCCCACTGTCACGGCACC (CTC) 9 59 116 

GGCTGTGGCGACAAGGCTG 

43 PBM073 FP096973 GU644413 AGCAGCTCTACGGCAAGAAGAAG (ATC) 8 53 139 

TGCAGCCTTGAGGAATTGAGAA 



(TCCA) 5 59 1 38 

(GAGT) 5 53 139 

(CG) 5 -(CGG) 9 60 1 83 
(CGG) 9 



0.359 ELF4-like protein 



KN1-type homeobox 
transcription factor 



0.305 Unknown protein 



0 Nonspecific lipid-transfer 
protein 2 precursor 

0 Nonspecific lipid-transfer 
protein 2 precursor 

0 Cp protein 



0 Unknown protein 



0 Transcription elongation 
factor-related protein 

0 Sucrose transporter 



0 Smr domain containing 
protein 

0 Bicolor hypothetical protein 



0 U-box domain 

containing protein 

0 Unknown protein 



0 Bicolor hypothetical protein 

0.477 Spliceosomal protein 

0 RNA-binding protein 

0 pollenless3 mRNA 

0.360 Alba superfamily protein 

0 Bicolor hypothetical protein 

0 Unknown protein 

0 Bicolor hypothetical protein 

0 Bicolor hypothetical protein 
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Table 3 Characteristics of the P. pubescens SSR markers derived from FL-cDNAs (Continued) 



44 PBM074 FP096816 GU644414 CCCACCGAAGTAATCACGC (CTC) 8 

CTCGCACAACAAAAGAAATCA 

45 PBM075 FP096707 GU644415 AGTTTCCTTCTTTCCTTCCTTCCGTGGTG (GAA) 8 

CGGCATTTGCGATTTGTGC 

46 PBM076 FP101632 GU644416 ATGCCTTCACCACACTTAC 

CATCGTGATGTCTCCAATC 

47 PBM077 FP096443 GU644417 CCGCTTCCTCCCACCAAAT 

CGCAGTACAGCAGCTCCCC 

48 PBM078 FP095554 GU644418 CCCAAATCCAACCAGAACCA 

GGAGGAGGCATTCGTAGGAGA 

49 PBM079 FP097911 GU644419 AAGGATGGTAACGTACATACA 

CATGACAAATTTAAAGGTATCA 



(FP092888) 

50 PBM080 FP093425 GU644420 CGAGGTFCTTGGGCTCAGTT 

ACACGCCTCCAATAAAACAAAC 

51 PBM081 FP097485 GU644421 TCTACTCCGTAGCCGCCTFC 

AGAGCCTCCATTGGATGGG 

52 PBM082 FP099753 GU644422 AATTTGTTGCCCTGCCTAGCT 

GCAAGATGAGAAGAATTAAAGCTGC 

53 PBM083 FP101428 GU644423 CCATTTGGCATTTGCTCCC 

GCACCCCGTAGAACCAGTCC 

54 PBM084 FP092513 GU644424 CTTCTCATGGGGTCAGCTACTC 

ATCACTTCTGCGATCTTGGTC 

55 PBM085 FP091409 GU644425 GGGGAGCCATCCTCAGTTT 

GCTGGCAGCTTCACCAACT 

56 PBM086 FP096167 GU644426 GTGGAAAATAAAGAAGCGC 

TTCCTGCTTTTGATCTTGC 

57 PBM087 FP093957 GU644427 ACCCCAAGCATCCCCAAAA 

CCGCAGGGAAGTCGAAGGTC 

58 PBM088 FP091571 GU644428 GTGTATTGGCTTTCCAGCTTTTCC 

TCTCCGCACGCTACTGTCCC 

59 PBM089 FP097920 GU644429 TCCCTTATCCACCAAACACGC 

GCTGGCAACGACGCACCTC 

60 PBM090 FP097267 GU644430 AGAGTCGGATAAGGGTAGCG 

CGATCTCGAAGTTCGTGCC 

61 PBM091 FP1 00553 GU644431 ATAGAGGCATACAGCCGCAGAC 

TAGGCACGGCATCACGGAC 

62 PBM092 FP099642 GU644432 GAACGCCGCATCCAGCCTCT 

GGTCGGGTCCTTGGACAAAC 

63 PBM093 FP1 00738 GU644433 TCGCAGTAAACAGTCTCATCACATC 

TCAGGGCCACCACCTCGTCT 

64 PBM094 FP095169 GU644434 GATTGAGGAGCCCCAAACC 

CACAACAACCGCAAGAGCC 

65 PBM095 FP098630 GU644435 TTATTAGTCGAGTTTGGGTCTCC 

GGTGAACGGCATGGCTGCT 



(GCA) B 

(CCG) 5 59 181 

{CGG) n 59 187 

(AT) 5 44 1 59 

(AG) 13 53 1 16 

(CT) 16 56 135 

(TC) 5 -(TC) I6 53 148 

(GA) 15 59 186 

(TC) 17 (AC) 16 53 201 

(TQi2-(CTT)6 55 183 

(TC) 9 -(TC) 9 51 139 

(CCT) 5 -(CGC) 9 59 166 



55 119 0 Transcription factor 

HBP-1a(c14) 

53 101 0.510 Unknown protein 



51 121 0 AP2/ERF domain protein 



(AG),, 

(CT) 17 
(AG) 12 
(AG), 4 

(JC),3 
(CCT) 8 

(CCG) 8 
(CCT) 8 



0 Bicolor hypothetical protein 



0 Anti-silencing protein 



0 Unknown protein 



0.375 ATP binding protein 



0 Pyridoxamine 5-phosphate 
oxidase 

0 Homeodomain leucine-zipper 
protein Hox8 

0 GTPase SAR1 (Sari .1 ) 



0.369 Brown planthopper-induced 
resistance protein 1 (Bil) 

0.346 Putative precursor micro 
RNA R167h gene 

0 Unknown protein 



0.373 Bicolor hypothetical protein 

„ ... 0 Knotted class 1 

homeodomain protein 

56 1 72 0.369 Bicolor hypothetical protein 

53 106 0.195 Repair protein RAD23 



56 126 0.369 Macrophage migration 

inhibitory factor 

53 155 0 Basic/leucine zipper protein 



59 150 0 Disulfide isomerase (PDIL2-2) 



57 257 0 DUF2372 superfamily protein 



55 115 0.430 Unknown protein 



Lin et al. SpringerPlus 2014, 3:486 
http://www.springerplus.eom/content/3/1/486 



Page 7 of 12 



Table 3 Characteristics of the P. pubescens SSR markers derived from FL-cDNAs (Continued) 



66 


PBM096 


FP100124 


GU644436 


CACTCGGCTCGTCCTCGTCT 
AGGGTGGCTAAGGCTCGTCTC 


(CCTC) 6 


60 


129 


0 


PLAC8 superfamily protein 


67 


PBM097 


FP099849 


GU644437 


CTGCCACTCCATCCCTGCC 
CTCGATGGCGACGGCTGTT 


(CACGCG) 5 


59 


101 


0 


Unknown protein 


68 


PBM098 


FP097471 


GU644438 


CCCCGTCTTCTCGTCGTCT 
GACTTTGTCGGAGCCCTTGA 


(TCGCCG) 5 


56 


169 


0 


BAH_BAHCC1 superfamily 
protein 



species with a large genome (data unpublished) in which 
we developed 15 EST-SSR markers from 52 promising se- 
quences selected from 3406 non-redundant ESTs (Dong 
etal. 2011). 

We surveyed the allelic variability of the markers by 
genotyping 50 open-pollinated seedlings germinated 
from the year 2010 seedlot (Table 3). Among the 68 FL- 
cDNA SSR markers, only 22 (32.4%) showed polymorph- 
ism. The polymorphism information content (PIC) 
values of the 68 markers ranged from 0 to 0.51 with a 
mean value of 0.12. For the 22 polymorphic loci, the PIC 
values ranged from 0.19 to 0.51 with a mean value of 
0.36, and the top ten markers in terms of polymorphism 
were PBM075, PBM069, PBM095, PBM046, PBM066, 
PBM080, PBm087, PBM044, PBM084 and PBM091. SSR 
polymorphism in P. pubescens is much lower than ob- 
served in cereals (Thiel et al. 2003; Yu et al. 2004), coffee 
(Aggarwal et al. 2007) and the rubber tree (Feng et al. 
2009). Bamboo P. pubescens has a long flowering inter- 
val of more than 60 years (Janzen 1976; Watanabe et al. 
1982). Therefore, open pollination (DNA recombination) 
appears to have limited the amount of replication slippage, 
which diversifies SSR alleles (Richards and Sutherland 
1994; Jakupiak and Wells 1999). Clonal propagation in the 
interim periods of flowering has reduced the SSR diversity 
in bamboo (Nayak and Rout 2005). In a previous study, 
we discovered almost no allelic variation in the panel of 
11 varieties and 17 provenances of P. pubescens using 19 
GSS-SSRs (Tang et al. 2010). 

Interspecific transferability and polymorphism of P. pubescens 
FL-cDNA SSR markers 

Although more than 1000 bamboo species have been de- 
scribed, the vast majority of publically-available sequence 



data are derived from P. pubescens (Tang 2009). Therefore, 
the development of a set of transferable P. pubescens FL- 
cDNA SSR markers suitable for other bamboo species 
would help to accelerate genetic research and comparative 
genomics in the Bambusoideae subfamily. Previously, we 
developed 19 P. pubescens GSS-SSR markers and success- 
fully transferred them to six other Phyllostachys species 
with an average transferability of 75.3% and 66.7% poly- 
morphism (Tang et al. 2010). In B. arundinacea, 100% and 
83.3% transferability were achieved with 6 SSR markers in 
eight other Bambusa species and 10 species of other gen- 
era, respectively (Nayak and Rout 2005). In B. oldhamii, 
we achieved an average 59.6% transferability and 51.4% 
polymorphism with 15 markers in 14 bamboo species in- 
cluding four species within the same genus (Dong et al. 
2011). We tested the transferability and polymorphism of 
these 68 putative FL-cDNA SSR markers across 41 diverse 
species in six tribes of the Bambusoideae subfamily, as de- 
fined by Das et al. (2008) and Yang et al. (2008) (Additional 
file 2: Table SI and Additional file 3: Table S2). Successful 
amplification became less likely with increasing phylo- 
genetic distance from P. pubescens, with an 83.1% success 
rate within the genus Phyllostachys, a 79.4% success rate 
across genera within the subtribe Shibataeeae, and a 49.3% 
average success rate for other subtribes, ranging from 
36.8-76.5% (Table 4 and Figure 1). In contrast, the num- 
ber of markers showing polymorphism increased with 
phylogenetic distance, with 79.4% of markers showing 
polymorphism within the genus Phyllostachys, 91.3% 
showing polymorphism within the Shibataeeae, and 
92.8% showing polymorphism when comparing other 
subtribes. Markers in coding sequences were on average 
the most transferrable (69.1%) and the least polymorphic 
(89.4%), compared to those located in 5'-UTRs (63.4% 



Table 4 Transferability/polymorphism of P. pubescens FL-cDNA-derived SSR markers across species and genera in the 
Bambusoideae subfamily 



Types of 
EST-SSR 
(number) 


Intra-genus 
(Phyllostachys) 


Inter-genus within 
substribe (Shibataeeae) 


Inter-substribe 










Average 


Melocanninae 


Bambusinae 


Chusqueeae 


Arundinarieae 


Guaduinae 


5'-UTR (41) 


85.8%/81 .4% 


69.2%/91.3°/o 


42.7%/93.6% 


35.5%/944% 


41 .596/88.2% 


78.1%/91.4% 


29.3%/90.9% 


63.4%/90.7% 


ORF (18) 


80.696/71.3% 


70.196/91.0% 


58.3%/97.7% 


57.1%/95.7% 


61.196/90.9% 


75.6%/89.1% 


55.6%/95.8% 


69.194/89.4% 


3'-UTR (9) 


75.9%/86.6% 


72.2%/91.7% 


41.7%/95.0% 


46.0%/96.4% 


22296/1 00.0% 


70.9%/86.9% 


33.3%/1 00.0% 


61.8%/91.4% 


Average 


83.194/79.4% 


69.996/913% 


46.7%/94.9% 


42.6%/95.0% 


44.1%/90.5% 


76.5%/90.2% 


36.896/93.4% 
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Figure 1 Polyacrylamide gel electrophoresis bands representing microsatellites derived from FL-cDNA sequences, tested on a panel of 
selected bamboo species to evaluate transferability and polymorphism in locus of PBM042 (above) and PBM064 (nether). M: size 
marker. Mb, Cp, etc.: Bamboo species abbreviations are listed in Supplementary Table 1. 



transferrable, 90.7% polymorphic) and 3'-UTRs (61.8% 
transferrable, 91.4% polymorphic). These trends were ex- 
acerbated with increasing phylogenetic distance. These 
matches the results from a metastudy of 601 loci in 35 
plant species showing an average 89.8% transferability at 
the subgenus level, 76.4% at the genus level and 35.2% at 
the family level (Rossetto 2001). Interestingly, more than 
17 (25%) of the markers were transferrable to more than 
85% of the tested species (Additional file 3: Table S2). This 
success rate suggests that FL-cDNA SSRs and their 
flanking regions are sufficiently conserved (Zhang et al. 
2005), and it is therefore possible to transfer P. pubescens 
FL-cDNA SSR markers to other bamboo species for 
evolutionary studies and phylogenetic reconstructions 
(Sharma et al. 2008). 

Using polymorphic FL-cDNA SSR markers to estimate 
outcrossing rates and identify interspecific bamboo 
hybrids 

Sexual propagation increases genetic diversity by creating 
progenies of different genotypes through recombination 
(i.e. outcrossing). This is advantageous for predominantly 
clonal plants such as most bamboo species, which rely 
mostly on vegetative regeneration interspersed with occa- 
sional flowering (Janzen 1976). The analysis of the repro- 
ductive system is therefore fundamental to elucidate 
primary genetic diversity and the structure of regenerating 
bamboo populations, and to adopt strategies for genetic 
improvement. Previous studies on the bamboo reproduct- 
ive system based on field data and artificial pollination 



showed that self-compatibility is predominant in Sasa spe- 
cies (Nishiwaki and Konno 1990), and the selfing rate 
could approach 0.99 in Merostachys riedeliana (Guilherme 
and Ressel 2001). Outcrossing rate was estimated using 
SSR-based analysis as reported in S. cernua (Kitamura and 
Kawahara 2011). 

Among the 22 polymorphic SSR markers described 
above, the ten most polymorphic (PIC > 0.36) were used to 
detect polymorphisms in 50 open-pollinated half-sib 
seeds (year 2011) from three flowering sites in the Guangxi 
Province separated by at least 100 km. Polymorphism in 
the PBM044, PBM080 and PBM095 loci was identical 
in the seeds from all three flowering sites, whereas 
PBM084 and PBM091 featured additional alleles from 
Lipu, PBM069, PBM075, PBM087 and PBM091 fea- 
tured additional alleles from Lingchuan, and PBM069, 
PBM075 and PBM084 featured additional alleles from 
Guanyang (Table 5). This indicated that flowering culms 
in different sites featured diverse SSR genotypes and pro- 
duced genetically-diverse half-sib seed sources. Therefore, 
we used these eight polymorphic loci to estimate the out- 
crossing rates and other related genetic parameters for 
P. pubescens (Table 5). The overall estimates of tm and ts 
for three culms were 0.089 for both parameters, with no 
standard deviation. The estimates for individual culms 
showed small differences of 0.067 in Lipu and Lingchuan, 
and 0.133 in Guanyan, again for both parameters. Estima- 
tion of F is for the overall population was 0.195, indicating 
homozygote excess. We found that the outcrossing rate 
was 0.089, estimated from eight polymorphic multilocus 



Table 5 Seed number, estimated outcrossing rates and relative parameters for each of three P. pubescens flowering 
culms at 8 loci 



Flowering site 
(county) 


N 


tm 


ts 


Fis 








Genotype 




PBM044 


PBM069 


PBM075 


PBM080 PBM084 PBM087 PBM091 


PBM095 


Lipu 


so 


0.067 (0.0) 


0.066 (0.0) 


0.182 


P 






P P P 


P 


Lingchuan 


50 


0.067 (0.0) 


0.067 (0.0) 


0.173 


P 


P 


P 


P P P 


P 


Guanyan 


50 


0.133 (0.0) 


0.135 (0.0) 


0.231 


P 


P 


P 


P P 


P 


Average 




0.089 (0.0) 


0.089 (0.0) 


0.195 













N the number of analyzed seeds; tm multi-locus outcrossing rate and standard error in parentheses; ts, single-locus outcrossing rate and standard error 
in parentheses. Fis inbreeding coefficient; p polymorphism. 
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datasets in P. pubescens, which is slight lower than the 
0.148 reported in S. cernua using six multilocus SSR data- 
sets (Kitamura and Kawahara 2011). This indicated that 
the reproductive system of P. pubescens predominantly 
involves self-fertilization with an adequate proportion 
of crossing to ensure genetic diversity as reported for 
S. cernua (Kitamura and Kawahara 2011). 

The grow-out test for bamboo interspecific hybrids is 
time-consuming and laborious because it involves grow- 
ing plants to maturity (which takes at least 5 years), 
assessing several anatomical, morphological and floral 
(long-term interval) characteristics that distinguish the 
hybrid. The polymorphic SSR markers could also help in 
the rapid and accurate identification of interspecies hy- 
brids, as reported in poplar (Rajora and Rahman 2003) and 
wheat-barley (Malysheva et al. 2003). To obtain proof of 
principle that our novel SSR markers are suitable for hy- 
brid characterization, we next selected several highly- 
transferable and polymorphic FL-cDNA SSR markers. 
PBM032, PMB049, PMB063 and PMB064, each with a 
number of species-restricted alleles, were used to test 
uncharacterized bamboo samples. Marker PMB063 identi- 
fied the parental species in one hybrid as P. kwangsiensis 
and P. bambusoides, because all sequenced bands con- 
tained the (TCCA) n motif although with a variable number 
of repeats (Figure 2). Similarly, marker PMB064 identified 
the parental species B. pervariabilis and Dendrocalamus 
latiflorus which are distantly related to P. pubescens, with 
a variable number of repeats in the (GAGT) n motif 
(Figure 3). As previously shown using GSS-SSR markers, 
such high levels of transferability and polymorphism 
within the Bambusoideae subfamily should allow the use 
of FL-cDNA SSR markers to identify interspecific hybrids 
and their parents, both within the genus Phyllostachys 
(Tang et al. 2010) and in more distant taxa within subtribe 
of Shibataeeae (Lu et al. 2009). We have also developed 
several putative EST-SSR markers in B. oldhamii and have 
used these to identify some other sympodial bamboo 
interspecies hybrids (Wu et al. 2009; Dong et al. 
2011). The SSR markers developed in the present study 
were used to identify not only interspecific hybrids from 



monopodial Phyllostachys but also intergeneric hybrids 
with sympodial rhizomes, which are distantly related to 
P. pubescens. Our data confirmed that microsatellites, 
especially SSR markers based on cDNAs and ESTs, 
are ideal for the identification of bamboo interspecies 
hybrids. 

Conclusions 

Our data provide insight into the association between 
SSRs and TEs in FL-cDNAs from the P. pubescens tran- 
scriptome, allowing us to develop and evaluate 68 FL- 
cDNA SSR markers that can be used in P. pubescens and 
partially for many other bamboo species, to estimate the 
reproductive system of P. pubescens and identify several 
interspecific hybrids. These FL-cDNA SSR markers en- 
rich the molecular marker resources currently available 
for bamboo. When a large set of polymorphic markers 
becomes available, we can use genome-wide association 
mapping in bamboo, in the absence of structured popu- 
lations, to identify markers for traits of interest that can 
be used for marker-assisted selection in the Bambusoi- 
deae subfamily. 

Methods 

Full-length cDNA mining and SSR/TE detection 

We obtained 10,608 FL-cDNA sequences from NCBI 
Entrez (http://www.ncbi.nlm.nih.gov/) on July 1, 2010. 
These cDNA sequences were assembled from five cDNA 
libraries constructed from breaking-out shoots, young 
(40-cm) shoots and young leaves from plants, and shoots 
and roots from germinated seeds (Peng et al. 2010). We 
used EST Trimmer (http://pgrc.ipk-gatersleben.de/misa/ 
download/est_trimmer.pl) to remove poly(A/T) runs 
from the 5 ' and 3 ' ends until there were no occurrences 
of (T) 5 or (A) 5 within a 50-bp range. Redundant sequences 
were eliminated and overlapping sequences were spliced 
together using CAP3 (http://seq.cs.iastate.edu/cap3.html) 
(Huang and Madan 1999). 

After pre-treatment, we used MISA (http://pgrc.ipk- 
gatersleben.de/misa/misa.html) to screen for SSRs includ- 
ing mononucleotide repeats >10 bp in length, dinucleotide 



A 



1 .1 ( i t C ( U I ( C U J C » ( { A CCAtCCTC I t C C i t C U C A 1 U C 




,i, , -1CACCAACCAATCCI 

n-i . , i 

M-» . . C 



IK US 

TCCAO 
TCC.H3 



t C U M C C U C U K H U ( H c t t c c c c c 
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m4 

F tttrtutoidri -1 



B 



Figure 2 A, Microsatellite DNA fingerprints of P. kwangsiensis (line 1), P. bambusoides (line 3) and a presumed hybrid (line 2) at locus 
PBM063. B, Alignment of the nucleotide sequences of the microsatellite alleles at locus PBM014 amplified from P. kwangsiensis, P. bambusoides 
and two presumed hybrids. Nucleotides conserved among these sequences (relative to P. kwangsiensis) are shown by dots. The lines indicate the 
primer sequences used to amplify this microsatellite locus. The box highlights the microsatellite. The suffix numbers after bamboo species 
correspond to the DNA bands marked in part (a). 
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Figure 3 A, Microsatellite DNA fingerprints of Bambusa pervariabilis, Dendrocalamus latiflorus and their presumed hybrids at locus 
PBM064. B, Alignment of the nucleotide sequences of the microsatellite alleles at locus PBM064 amplified from B. pervariabilis, D. latiflorus and 
their presumed hybrids. Nucleotides conserved among these sequences (relative to 8. pervariabilis) are shown by dots. The lines indicate the 
primer sequences used to amplify this microsatellite locus. The box highlights the microsatellite. The suffix numbers after the bamboo species 
correspond to the DNA bands marked in part (A). 



to hexanucleotide repeats with >6 repeat units, and inter- 
rupted composite SSRs with <100 bp of intervening DNA. 
Putative annotations were assigned to non-redundant 
ESTs containing SSRs using BLAST against the Moso 
Bamboo cDNA Database (http://202.127.18.228/mbcd/) 
and the Gramene Ontologies Database (http://archive. 
gramene.org/plant_ontology/). TEs were identified using 
RepeatMasker and RepeatProteinMask (http://www. 
repeatmasker.org) based on similar elements present in 
the rice genome, and SSRs within TEs were screened using 
MIS A with the same parameters as above. Additional file 1: 
Figure SI provides a flow chart for the data mining and 
marker development process. 

Plant material and DNA extraction 

We used P. pubescens samples collected from the Anji 
Bamboo Germplasm Garden, Anji, Zhejiang Province, to 
identify and characterize putative FL-cDNA SSR markers. 
The polymorphism of these SSR markers was evaluated 
using 50 seedlings germinated from an open-pollinated see- 
dlots (mixed seed sources, mainly from different flowering 
sites in the counties of Lipu, Lingchuan and Guanyang, 
Guangxi Province in the year 2010). Another 50 seedlings 
were germinated from open-pollinated half-sib seeds (year 
2011) from three flowering culms in the same three coun- 
ties (>100 km between sites) and were used to estimate 
the P. pubescens outcrossing rate. We obtained 41 repre- 
sentative bamboo species from 38 genera within six sub- 
tribes mainly found in China to test the transferability and 
polymorphism of the FL-cDNA SSR markers (Additional 
file 2: Table SI). We obtained three Phyllostachys interspe- 
cific hybrids from Jiangxi Province, China, and two inter- 
generic hybrids from Yoshinaka Bamboo Germplasm 
Garden, Fukuoka, Japan, for the hybrid identification tests. 
Genomic DNA was extracted from young leaves using the 
hexadecyltrimethylammonium bromide (CTAB) method 
(Doyle and Doyle 1987), with some modifications. 

Amplification and sequencing of SSR loci 

Primer pairs designed according to the available cDNA se- 
quences were synthesized by Shanghai Sangon Biological 



Engineering Technology & Services Co., Ltd. P. pubescens 
DNA was amplified in 20- ul reactions comprising 50- 
100 ng of template DNA, 0.2 ujM of each primer, 200 uM 
of each dNTP and 1 unit of Taq DNA polymerase with lx 
PCR universal buffer (10 uM Tris-HCl, pH 8.3 at 25°C; 
50 uM KC1), and 1.5 uM MgC _ (Shanghai Sangon Bio- 
logical Engineering Technology & Services Co., Ltd). 
The reaction was heated to 95°C for 5 min using an ABI 
PE9700 thermocycler, followed by 30 cycles of 1 min de- 
naturation at 95°C, 1 min annealing at 46-59°C depend- 
ing on the primer pair (Table 3), and 2 min extension at 
72°C, followed by a final hold at 72°C for 5 min. Ampli- 
fied microsatellite loci were tested in 41 diverse species 
in six tribes of the Bambusoideae subfamily (Table 4) and 
interspecific hybrids (Figures 2 and 3). The annealing 
temperature was lowered by 2-5°C according to the 
evolutionary distance between species based on mo- 
lecular markers (Das et al. 2008) and nuclear and 
chloroplast sequences (Yang et al. 2008), as suggested 
by Rossetto (2001). PCR products were separated on 
6% polyacrylamide denaturing gels, and marker bands 
were revealed by silver staining as described by Panaud 
et al. (1996). Specific bands were excised directly from the 
silver staining polyacrylamide gel, purified using the 
EZ-10 Spin Column DNA Gel Extraction Kit (Biobasic 
Inc.) and ligated into the pUC18 vector (TaKaRa, 
Japan). Three positive clones for each bamboo species 
were selected for sequencing using BigDye terminator 
V3.1 in a cycle sequencing protocol according to the man- 
ufacturer's specifications (PE Applied Biosystems, ABI 
PRISM 3100-Avant Automatic DNA Sequencer). Vec- 
tor sequences were removed then edited using Vector 
NTI software (version 10.0, Invitrogen Co., USA). Se- 
quences were deposited in NCBI GenBank (accession nos 
GU644371-GU644438). 

Data analysis 

The polymorphism information content (PIC) (Botstein 
et al. 1980) of our SSR markers was determined using 
Powermarker v3.25 (Liu and Muse 2005). All 68 selected 
primer pairs were used to amplify template DNA from 
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41 bamboo species covering 35 genera in six subtribes 
(Additional file 2: Table SI) and the statistical methods 
of Nayak and Rout (2005) and Sharma et al. (2009) were 
used to calculate the cross-taxon transferability and poly- 
morphism (Additional file 3: Table S2), in which poly- 
morphism is calculated only from the loci that were 
successfully transferred across taxa (Rossetto 2001). Single 
locus and multilocus outcrossing rates and relative param- 
eters were analyzed separately under the mixed mating 
model of Ritland & Jain (1981) and Ritland (2002), imple- 
mented using MLTR v3.4 (Ritland 1996). 

Additional files 



Additional file 1: Figure SI. Scheme used for database mining and 
the deveiopment of SSR markers from P. pubescens FL-cDNA sequences. 

Additional file 2: Table SI. Species used to test cross species/genus 
amplification of P. pubescens FL-cDNA SSR loci. 

Additional file 3: Table S2. Cross species/genus amplification of 
P. pubescens FL-cDNA SSR loci. 
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